gh-144356: Avoid races when computing `set_iterator.__length_hint__` under no-gil by hyongtao-code · Pull Request #144357 · python/cpython

hyongtao-code · 2026-01-31T03:34:26Z

Long log:

setiter_len() was reading so->used without atomic access while concurrent
mutations update it atomically under Py_GIL_DISABLED.

In free-threaded builds, setiter_len() could race with concurrent set
mutation and iterator exhaustion.

Use an atomic load for so->used to avoid a data race. This preserves the
existing semantics of __length_hint__ while making the access thread-safe.

Signed-off-by: Yongtao Huang yongtaoh2022@gmail.com

Issue: Data race in set iterator length_hint under no-gil #144356

setiter_len() was reading so->used without atomic access while concurrent mutations update it atomically under Py_GIL_DISABLED. Use an atomic load for so->used to avoid a data race. This preserves the existing semantics of __length_hint__ while making the access thread-safe. Signed-off-by: Yongtao Huang <yongtaoh2022@gmail.com>

eendebakpt · 2026-01-31T18:45:03Z

Lib/test/test_free_threading/test_set.py

+        for t in threads:
+            t.start()
+
+        stop.set()


This means the threads will stop right after they have started. I would prefer the pattern that is used in some other tests in this file: set a constant NUM_LOOPS (determined so that the test < 0.1 seconds, but there still is a decent number of mutations)

eendebakpt · 2026-01-31T21:19:23Z

Objects/setobject.c

    setiterobject *si = (setiterobject*)op;
    Py_ssize_t len = 0;
-    if (si->si_set != NULL && si->si_used == si->si_set->used)
+    PySetObject *so = si->si_set;


Here so is a borrowed reference to si->si_set. But si->si_set can be cleared in setiter_iternext (if the iterator is exhausted) outside the critical section.

This is a different mechanism than the corresponding issue, so maybe something to address in another PR. But solving both together is something to consider.

Good catch. Thanks a lot.

hyongtao-code · 2026-02-01T16:21:39Z

Thanks for the review. I’ve decided to address both issues in this PR. I also added a corresponding test case for the issue you pointed out.

eendebakpt · 2026-02-01T21:02:28Z

Objects/setobject.c

    setiterobject *si = (setiterobject*)op;
    Py_ssize_t len = 0;
-    if (si->si_set != NULL && si->si_used == si->si_set->used)
+#ifdef Py_GIL_DISABLED


This might work for setiter_len, but setiter_iternext itself is not yet thread safe (also because of setting si->si_set to zero).

For several other iterations the approach is to keep the reference si->si_set , but use another attribute to signal exhaustion of the iterator. For example for itertools.cycle or the reversed operator.

Note: I tried creating a minimal example where concurrent iteration fails, but I have succeeded yet (the example does not crash, although I have not run thread sanitizer on it yet)

Test for concurrent iteration on set iterator

import unittest from threading import Thread, Barrier class TestSetIter(unittest.TestCase): def test_set_iter(self): """Test concurrent iteration over a set""" NUM_LOOPS = 10_000 NUM_THREADS = 4 for ii in range(NUM_LOOPS): if ii % 1000 ==0: print(f'test_set_iter {ii}') barrier = Barrier(NUM_THREADS) # make sure the underlying set is unique referenced by the iterator iterator = iter(set((1,2,))) def worker(): barrier.wait() while True: iterator.__length_hint__() try: next(iterator) except StopIteration: break threads = [Thread(target=worker) for _ in range(NUM_THREADS)] for t in threads: t.start() for t in threads: t.join() assert iterator.__length_hint__()==0 if __name__ == "__main__": unittest.main()

Thank you. I think your points make a lot of sense, and I really appreciate the two links you shared—they helped me get a more complete picture of the iterator-related data race.
I’ll try to construct the case you mentioned under a TSan environment.
If it turns out to be appropriate, we can address it fully in this PR, that would be great. Of course, this will take some time.

Yes, we should fix this like we have fixed others and as Sam suggested only clear the associated set in non-free-threading builds. The current code is incorrect because it uses try incref which can fail spuriously if the set object is not marked to enable try incref.

colesbury · 2026-02-02T18:58:18Z

Objects/setobject.c

    Py_END_CRITICAL_SECTION();
    si->si_pos = i+1;
    if (key == NULL) {
+#ifdef Py_GIL_DISABLED


I think we should follow the pattern that we use in other iterators: don't clear si->si_set when the iterator is exhausted in the free-threaded build.

That will keep other things simpler.

eendebakpt

I left some more review comments. I think we can get this right, but a simpler approach here would be to put a critical section on the set iterator itself.

eendebakpt · 2026-02-06T20:19:23Z

Lib/test/test_free_threading/test_set.py

+                barrier.wait()
+                barrier.wait()
+
+        t1 = Thread(target=advancer)


Please use the same style for starting and stopping threads as the other tests in this file (e.g. test_contains_hash_mutate)

eendebakpt · 2026-02-06T20:21:09Z

Objects/setobject.c

-        i++;
+    if (i < 0) {
+        /* iterator already exhausted */
+        exhausted = 1;


Suggested change

exhausted = 1;

return NULL;

(the exhausted variable is then not needed any more I believe)

You cannot directly return as it would skip ending the critical section

eendebakpt · 2026-02-06T20:22:53Z

Objects/setobject.c

+#ifdef Py_GIL_DISABLED
+            FT_ATOMIC_STORE_SSIZE_RELAXED(si->si_pos, i + 1);
+#else
+            si->si_pos = i + 1;
+#endif


Suggested change

#ifdef Py_GIL_DISABLED

FT_ATOMIC_STORE_SSIZE_RELAXED(si->si_pos, i + 1);

#else

si->si_pos = i + 1;

#endif

FT_ATOMIC_STORE_SSIZE_RELAXED(si->si_pos, i + 1);

On the normal build the macro will expand to si->si_pos = i + 1;

eendebakpt · 2026-02-06T20:23:44Z

Objects/setobject.c

+#ifdef Py_GIL_DISABLED
+            /* free-threaded: keep si_set; just mark exhausted */
+            FT_ATOMIC_STORE_SSIZE_RELAXED(si->si_pos, -1);
+            si->len = 0;


This (and some other places) should also be atomic?

eendebakpt · 2026-02-06T20:32:55Z

Objects/setobject.c

+        else {
+#ifdef Py_GIL_DISABLED
+            /* free-threaded: keep si_set; just mark exhausted */
+            FT_ATOMIC_STORE_SSIZE_RELAXED(si->si_pos, -1);


The value -1 written here could be overwritten by a concurrent thread (at line 1155). Which means that over exhaustion of the set iterator it is restored back to life. This does not lead to overflows or other issues (afaic), but is a bit odd behaviour.

eendebakpt · 2026-02-06T20:34:51Z

Objects/setobject.c

+
    if (key == NULL) {
-        si->si_set = NULL;
+#ifndef Py_GIL_DISABLED


I think in the normal build you still have to do si->si_set = NULL;, otherwise the si->si_set is decref'ed again in setiter_dealloc.

hyongtao-code added 2 commits January 31, 2026 10:51

Add test case

3e3785c

hyongtao-code requested a review from rhettinger as a code owner January 31, 2026 03:34

bedevere-app bot mentioned this pull request Jan 31, 2026

Data race in set iterator length_hint under no-gil #144356

Open

bedevere-app bot added the awaiting review label Jan 31, 2026

📜🤖 Added by blurb_it.

229ced3

eendebakpt reviewed Jan 31, 2026

View reviewed changes

hyongtao-code added 2 commits February 1, 2026 23:25

Resovle comments

cdcf88a

Update test case: used_race and exhaust_race

21f1478

hyongtao-code changed the title ~~gh-144356: fix data race in setiter_len() under no-gil~~ gh-144356: Avoid races when computing set_iterator.__length_hint__ under no-gil Feb 1, 2026

eendebakpt reviewed Feb 1, 2026

View reviewed changes

colesbury reviewed Feb 2, 2026

View reviewed changes

colesbury requested a review from kumaraditya303 February 2, 2026 18:58

hyongtao-code added 3 commits February 7, 2026 00:07

Add test case

a18c698

Try to address the comments

79b5fbc

Post fix

6ac15e0

eendebakpt reviewed Feb 6, 2026

View reviewed changes

Uh oh!

Conversation

hyongtao-code commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hyongtao-code commented Feb 1, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eendebakpt left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hyongtao-code commented Jan 31, 2026 •

edited

Loading